Hey, I'm looking to compile info off several different pdfs into a single text file and I'm wondering if there is a quick way to de-columnize the text I cut and paste. I'm going to be mixing different original font sizes and column sizes so I want to be able to wipe out most of the original formatting so it fits the normal straight line margin to margin.
I've got MS office and Sun open office.
thanks in advance for the help (unless no one helps in which case pox on you all!!!!)
formatting issue
Moderator: Moderators
formatting issue
The internet gave a voice to the world thus gave definitive proof that the world is mostly full of idiots.
- Josh_Kablack
- King
- Posts: 5317
- Joined: Fri Mar 07, 2008 7:54 pm
- Location: Online. duh
save as plaintext, or just paste into notepad (or other plaintext editor)
"But transportation issues are social-justice issues. The toll of bad transit policies and worse infrastructure—trains and buses that don’t run well and badly serve low-income neighborhoods, vehicular traffic that pollutes the environment and endangers the lives of cyclists and pedestrians—is borne disproportionately by black and brown communities."
Thanks for the tips, but neither of them will eliminate the present breaks of the columns. I'm trying to avoid having to remove them manually from a hundred pages of combined texts from three books.
I'm trying to compile all the character generation info for Gurps 4ed into a single text rather than shuffling through 5 books to find a slight variation of on the same ability or skill.
I have managed to finagle the fonts and column size on the page to look about right though.
As aa second query. If I were to print a PDF but wanted to get rid of the margin art, is there an easy way to do that? the gurps margin art is just a solid dark colored band which is both ugly and ink wasting
I'm trying to compile all the character generation info for Gurps 4ed into a single text rather than shuffling through 5 books to find a slight variation of on the same ability or skill.
I have managed to finagle the fonts and column size on the page to look about right though.
As aa second query. If I were to print a PDF but wanted to get rid of the margin art, is there an easy way to do that? the gurps margin art is just a solid dark colored band which is both ugly and ink wasting
Last edited by ckafrica on Thu Dec 04, 2008 4:40 am, edited 1 time in total.
The internet gave a voice to the world thus gave definitive proof that the world is mostly full of idiots.
- Josh_Kablack
- King
- Posts: 5317
- Joined: Fri Mar 07, 2008 7:54 pm
- Location: Online. duh
Here's a help page for Open Office
And as further options, you might consider:
Linux FMT command (if you have easy access to the right distro), or learning enough PERL to write a script to remove hard line breaks that occur before you've counted a block of text N characters long.
Oddly you can use the search and replace with the \n regular expression to replace characters with hard line breaks, but it doesn't seem to work to search for hard line breaks to replace them with other characters.Removing Line Breaks
Use the AutoFormat feature to remove line breaks that occur within sentences. Unwanted line breaks can occur when you copy text from another source and paste it into a text document.
This AutoFormat feature only works on text that is formatted with the "Default" paragraph style.
1.Choose Tools - AutoCorrect.
2.On the Options tab, ensure that Combine single line paragraphs if length greater than 50% is selected. To change the minimum percentage for the line length, double-click the option in the list, and then enter a new percentage.
3.Click OK.
4.Select the text containing the line breaks that you want to remove.
5.In the Apply Style box on the Formatting bar, choose Default.
6.Choose Format - AutoFormat - Apply.
And as further options, you might consider:
Linux FMT command (if you have easy access to the right distro), or learning enough PERL to write a script to remove hard line breaks that occur before you've counted a block of text N characters long.
Last edited by Josh_Kablack on Thu Dec 04, 2008 5:30 am, edited 1 time in total.
"But transportation issues are social-justice issues. The toll of bad transit policies and worse infrastructure—trains and buses that don’t run well and badly serve low-income neighborhoods, vehicular traffic that pollutes the environment and endangers the lives of cyclists and pedestrians—is borne disproportionately by black and brown communities."
